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Genome-wide Analysis of PHD-finger Protein 
Family in Arabidopsis thaliana 
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( College of Life and Environment Sciences, Shanghai Normal University, Shanghai 200234, China) 


Abstrad: PHD-finger proteins were found universally in eukaryotes and known as key players in regulating chromatin or 
transcription . In animals, many PHD-finger proteins had been well studied on structure and function . Compared to ani- 
mals, only a few of plant PHD-finger proteins had been characterized functionally . By both SMART and Pfam searches, 70 
members of the PHD-finger family were identified in Arabidopsis thaliana . Most of the PHD-finger proteins in Arabidopsis 
thaliana were functionally unknown . A complete overview of this family by bioinformatic approach were presented in this 
paper, including the gene structure, gene localization in genome, gene expression, protein motif organization, and phylo- 
genetic relationship of each family member . These results should provide useful information for the further structural and 
functional studies of PHD-finger proteins . 
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Locus 
At1g05380 
At1g05830 
At1g14510 
At1g14770 
At1g32810 
At1g33420 
At1g43770 
At1g50620 
At1g54390 
At1g57800 УІМ5 
At1g57820 VIMI 
At1g63490 
At1g66040 VIM4 
At1g66050 VIM2 
At1g66170 


Other names Domain number 
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Members of Arabidopsis PHD-finger family 
























































50 (bp)? XD (bp) (Аа) 
5 Епа 3 End Protein length 
1, 576, 862 1, 582, 613 1138 
1, 754, 243 1, 761, 816 1056 
4, 964, 423 4, 961, 881 252 
5. 088, 841 5. 086, 621 429 
11, 881, 790 11, 885, 823 1068 
12, 123, 556 12, 120, 866 697 
16, 550, 881 16, 552, 991 431 
18, 752, 205 18, 755, 006 629 
20, 310, 691 20, 308, 234 328 
21, 415, 948 21, 412, 288 660 
21, 421, 611 21, 417, 835 645 
23, 555, 695 23, 548, 321 1116 
24, 590, 344 24, 587, 483 622 
24, 593, 150 24, 596, 410 623 
24, 645, 157 24, 642, 348 704 


230 ПП ü H ПП ЗІП 



































































































































001 
ш 50 (bp)? 3[] (bp) (Aa) 
Locus Other names Domain number 5 End 3 End Protein length 
At1g68030 1 25, 505, 207 25, 504, 041 314 
At1g77250 2 29, 028, 368 29, 025, 159 522 
At1g77800 1 29, 258, 290 29, 265, 511 1423 
А11279350 1 29, 858, 307 29, 849, 548 1299 
At2g01810 1 347, 536 350, 178 697 
At2g02470 1 652, 579 654, 914 256 
А12219260 1 8, 363, 223 8, 367, 251 631 
A12g25170 CHR6, PKL 1 10, 721, 485 10, 731, 065 1384 
А12227980 1 11, 926, 818 11, 920, 706 1037 
A12g31650 ATXI 1 13, 469, 258 13, 462, 349 1062 
A12g36720 2 15, 400, 150 15, 406, 497 1007 
A12g37520 2 15, 756, 866 15, 751, 847 854 
At3g01460 MBD9 2 173, 323 182, 461 2176 
At3g02890 1 641, 050 645, 310 963 
At3g05670 1 1, 653, 270 1, 657, 165 883 
At3g07780 OBERONI 1 2, 484, 475 2, 487, 030 566 
At3g08020 2 2, 561, 808 2, 557, 443 779 
At3g11200 1 3, 510, 633 3, 508, 048 246 
At3g14740 1 4, 953, 430 4, 952, 011 343 
At3g14980 2 5, 044, 818 5, 039, 938 1189 
At3g19510 HAT3 . 1 1 6, 766, 570 6, 762, 770 723 
At3g20280 1 7, 071, 199 7, 074, 647 743 
At3g24010 1 8, 678, 227 8, 675, 965 234 
At3g42790 1 14, 890, 769 14, 888, 946 250 
At3g52100 2 19, 330, 321 19, 333, 751 696 
At3g53680 1 19, 908, 390 19, 903, 841 839 
At3g61740 2 22, 867, 947 22, 861, 812 1018 
At4g 10600 1 6, 548, 860 6, 547, 782 237 
At4g12620 ATORCIB 1 7, 462, 250 7, 459, 643 813 
At4g14700 ATORCIA 1 8, 422, 226 8, 424, 755 809 
At4g22140 1 11, 730, 521 11, 727, 738 234 
At4g27910 2 13, 894, 700 13, 900, 480 1027 
At4g29940 PRHA 1 14, 653, 385 14, 648, 145 796 
At4g39100 SHL1 1 18, 220, 276 18, 217, 933 228 
At5g05610 1 1, 679, 105 1, 676, 933 241 
At5g09790 ATXR5 1 3, 041, 045 3, 038, 979 379 
At5g12400 1 4, 013, 814 4, 020, 910 1595 
At5g15540 1 5, 057, 414 5, 047, 676 1846 
At5g16680 1 5, 473, 509 5, 467, 537 1290 
At5g20510 1 6, 942, 910 6, 939, 815 260 
At5g22260 MS1 1 7, 370, 298 7, 367, 638 672 
At5g22760 2 7, 571, 638 7, 577, 869 1566 
At5g24330 ATXR6 1 8, 297, 071 8, 295, 150 349 
At5g26210 1 9, 160, 386 9, 158, 323 255 
At5g35210 2 13, 499, 468 13, 491, 426 1576 
At5g36670 1 14, 418, 721 14, 423, 657 1193 
At5g36740 1 14, 478, 020 14, 482, 957 1179 
At5g39550 VIM3 1 15, 857, 864 15, 854, 477 617 
At5g44800 CHR4 1 18, 110, 679 18, 100, 646 2242 
At5g48160 OBERON2 1 19, 547, 542 19, 545, 059 574 
At5g53430 АТХ5 2 21, 694, 372 21, 700, 720 1043 
At5g57380 VIN3 1 23, 266, 002 23, 263, 621 620 
At5g58610 2 23, 708, 279 23, 703, 997 1065 
At5g60410 SIZI 1 24, 312, 116 24, 318, 373 885 
At5g63900 2 25, 585, 924 25, 587, 597 557 
Dol PHD-finger O00. 0 SMART[] Pfam Р 5 000° 3 


















































































































































































































































à The number of PHD-finger domain in a protein is determined by both SMART and Pfam searches . P The position of 5 end of gene on chromosomes . 


° The position of 3 end of gene on chromosomes . 
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Fig.1 Structures and expression patterns of the PHD-finger gene and NJ phylogenetic tree of the PHD-finger proteins іп Arabidopsis 
The figure on the left shows the neighbor-joining phylogenetic tree constructed using ClustalX1.83, summarizing the evolutionary relationship among the 70 


members of the PHD-finger proteins in Arabidopsis . The proteins are named according to their gene loci (see Table 1) . 17 subfamilies (named with letters 


A to Q) with high bootstrap values ( » 900) are separated by dashed lines from the phylogenetic tree . The numbers beside the branches represent bootstrap 
values based on 1000 replications . The middle figure shows the expression patterns of the PHD-finger gene . The letter F above the column of expression data 
refers to flower, R refers to root, L refers to leaf, and S refers to silique . The positive signals are indicated by gray boxes below the letters . The figure on 


the right shows the gene structures . The black boxes indicate exons . The black lines between the black boxes indicate introns . 
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Fig.2 Multiple sequence alignment of the PHD-finger domains 
Multiple sequence alignment of 84 PHD-finger domains identified in this study . The domains are named according to their gene loci . 
. The shading of 


The positions of the PHD-finger domains in the proteins are indicated by the number at the end of the domain names 


the alignment presents identical residues in black and similar residues in gray . The consensus residues are shown at bottom . 
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Fig.3 NJ phylogenetic tree of the PHD-finger domains іп Arabidopsis 


The phylogenetic tree, based on multiple sequence alignment of PHD-finger domains, 


summarizes the evolutionary relationships among 


the PHD-finger domains in Arabidopsis .The domains are named according to their gene loci and the positions of the PHD-finger doma - 


ins in the proteins are indicated at the end of the domain names . 


1000 replications . 


The numbers beside the branches represent bootstrap values based on 


The domains given the same symbols in a clade have close evolutionary relationships . 
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Fig.4  Chromosomal distribution and duplication events for Arabidapsis PHD-finger genes 
The chromosome number is indicated at the top of each chromosome . а: Af2g18880; b: At4g04260; c: At4g14920; d: At4g30200; 
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e: At4g31900 .The five genes indicated with letters a to e are not PHD-finger genes . Gray lines connect corresponding duplicated 


genes . The genes involved tandem duplication events are marked with the gray background . 
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Fig. 5 Domains of protein in some subfamily 





The protein domains in A, B, D, H, K, P and Q Subfamily determined by both SMART and Pfam searches are shown . The black 


boxes indicate the PHD-finger domain . The white boxes with numbers 1 to 19 indicate 19 domains except PHD-finger domain . The 


names of the 19 domains are shown in top right . 
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